Towards data-intensive testing of a broad-coverage LFG grammar

نویسنده

  • Jonas Kuhn
چکیده

This paper addresses the problem that manual checking of output representations becomes impracticable in extensive tests during grammar development or in data-intensive applications of the grammar, like grammar-based lexicon acquisition from corpora. A method of annotating the sentences to be parsed with target expressions is proposed, using the LFG formalism itself to specify the expressions, such that the check of the actual solutions against the target speci cation is performed by the standard LFG constraint solver.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards data - intensive testing of abroad - coverage LFG grammar Jonas

This paper addresses the problem that manual checking of output representations becomes impracticable in extensive tests during grammar development or in data-intensive applications of the grammar, like grammar-based lexicon acquisition from corpora. A method of annotating the sentences to be parsed with target expressions is proposed, using the LFG formalism itself to specify the expressions, ...

متن کامل

A Comparison of Evaluation Metrics for a Broad-Coverage Stochastic Parser

This paper reports on the use of two distinct evaluation metrics for assessing a stochastic parsing model consisting of a broad-coverage Lexical-Functional Grammar (LFG), an efficient constraint-based parser and a stochastic disambiguation model. The first evaluation metric measures matches of predicate-argument relations in LFG f-structures (henceforth the LFG annotation scheme) to a gold stan...

متن کامل

Cross-Lingual Induction for Deep Broad-Coverage Syntax: A Case Study on German Participles

This paper is a case study on cross-lingual induction of lexical resources for deep, broad-coverage syntactic analysis of German. We use a parallel corpus to induce a classifier for German participles which can predict their syntactic category. By means of this classifier, we induce a resource of adverbial participles from a huge monolingual corpus of German. We integrate the resource into a Ge...

متن کامل

Treebank-Based Acquisition of Multilingual Unification Grammar Resources

Deep unification(constraint-)based grammars are usually hand-crafted. Scaling such grammars from fragments to unrestricted text is time-consuming and expensive. This problem can be exacerbated in multilingual broad-coverage grammar development scenarios. Cahill et al. (2002, 2004) and O’Donovan et al. (2004) present an automatic f-structure annotation-based methodology to acquire broad-coverage...

متن کامل

Developing German Semantics on the basis of Parallel LFG Grammars

This paper reports on the development of a core semantics for German which was implemented on the basis of an English semantics that converts LFG f-structures to flat meaning representations in a Neo-Davidsonian style. Thanks to the parallel design of the broad-coverage LFG grammars written in the context of the ParGram project (Butt et al., 2002) and the general surface independence of LFG f-s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998